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Abstract — In this paper, an approach for automatic cloud 
detection and localization in satellite remote sensing images is 
introduced. Cloud detection is useful in improving the accuracy 
of land cover classification in cloudy satellite images. The 
accurate detection of clouds in satellite images is vital for many 
atmospheric and terrestrial applications. In this paper we 
propose an algorithm for automatic cloud detection based on 
neutrosophic set and wavelet transform. The proposed approach 
uses both color and texture features for cloud detection. The 
input image is transformed into Lab color model for extracting 
the color features and gray scale image for extracting the texture 
features. Transformed images are converted into neutrosophic 
domain. An indeterminacy reduction operation is performed for 
getting better results. Finally a Fuzzy C-means clustering is 
performed on the true subsets. This gives the cloud detected 
image. This method is efficient in detecting thick clouds and thin 
clouds in Landsat images. Result analysis shows that the 
proposed algorithm can effectively detect the thin cloud. The 
proposed algorithm gives accurate results in less time complexity. 

Keywords — Cloud detection; Neutrosophic Set; Color 
transformation; Fuzzy Clustering; Landsat ETM+ 

I. Introduction 

When the world’s first satellite pictures of the atmosphere 
were viewed, the most remarkable feature was the extensive 
cloud cover over large parts of earth. Even today, the first 
feature that one usually notices in the satellite image is the 
presence of clouds. Persistent cloud covers over many regions 
cause difficulties in remote sensing with optical satellite 
imagery. Globally, land scenes are, on average, about 35% 
cloud covered, as reported by Ju and Roy [1], indicating that 
cloud covers are usually present in optical satellite images. 
This phenomenon limits the usage of optical images and 
increases the difficulty in image analysis. Cloud detection is 
useful in improving the accuracy of land cover classification in 
multispectral images. Clouds may seem random in shape and 
distribution. Cloud cover is a big challenge in optical remote 
sensing of the earth surfaces, especially over the humid tropical 
regions. Clouds are generally characterized by higher 
reflectance and lower temperature than the underlying earth 
surface. The brightness of clouds in a satellite image is one of 
the best hints to use. High brightness values in visible satellite 
images are associated with thick clouds, which tend to reflect 
much of the sun’s light. Therefore thick clouds appear white in 



visible satellite images. Thin clouds appear darker or even are 
transparent in visible satellite imagery. Automated cloud 
detection methods use spatial analysis methods to detect the 
contrast between reflected energy from clouds and nearby 
scenes and can be used to determine the extent of cloud cover 
over a region. 

The rest of the paper is organized as follows. Section II 
describes literature review. Section III explains about 
neutrosophic set. Section IV discusses the proposed cloud 
detection algorithm. Section V demonstrates experimental 
results and analysis. Conclusion is given in Section VI. 

II. Literature Review 

Cloud detection is a one of the most key processes in 
weather prediction and regional planning. Detection of clouds 
helps to improve the efficiency in study of climatic changes. 
Wang et. al. [2] proposed a scheme to remove clouds and their 
shadows from remotely sensed images of Landsat TM. The 
cloud regions can be detected on the basis of the reflectance 
differences with the other regions. An efficient and robust 
neural network-based scheme is introduced by Azimi-Sadjadi 
[3] to perform automatic cloud detection and classification. 
This method uses an unsupervised Kohonen neural network to 
classify the cloud contents of an 8*8 blocks in an image into 
ten different cloud classes. Inputs to the network consist of 
texture features of each blocks extracted using wavelet 
transform (WT). Salvador et al. [4] adopted pseudo- Wigner 
distribution (PWD) to detect the cloud. An automated cloud 
cover assessment (ACCA) algorithm [5] has been developed 
for estimating the percentage of cloud cover in Landsat 7 
Enhanced Thematic Mapper Plus (ETM+) images. Jiao et al. 
[6] proposed a Data Ratio (IRDR) algorithm based an intensity 
threshold and the method of maximum likelihood estimation to 
classify cloud-covered and cloud-free area. The cloud detection 
algorithm proposed by Yu Tong et al. [7] detects clouds and 
shadows in remote sensed images based on MSER (Maximally 
Stable Extremal Regions). Huang et al.[8] uses clear view 
forest pixels as a reference to define cloud boundaries for 
separating cloud from clear view surfaces in a spectral- 
temperature space for detecting clouds. Since this algorithm is 
proposed for forest change analysis, it requires forest pixels to 
determine the cloud boundaries, this method can be applied 
only to images where forest pixels exist. Neural network-based 
cloud classification on satellite imagery using texture features 
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is developed by Bin Tian[9]. In that method, several image 
transformation schemes such as wavelet transform (WT) and 
singular value decomposition (SVD) are used to extract the 
salient texture feature of the data and is then compared with 
those of the well-known gray-level co-occurrence matrix 
(GLCM) approach. Cheng and Y. Guo [10] proposed an 
approach for image segmentation based on neutrosophic set. 
But, this method works only for gray scale images. 

In this paper, a new approach is proposed for cloud 
detection using neutrosophic set and wavelet transform. 
Neutrosophic set provides a powerful tool to deal with the 
indeterminacy of the image. The clouds can be detected 
effectively by minimizing the indeterminacy. This method is 
applicable to detect thin and thick clouds in Landsat images. 
The goal of the proposed work is to automatically detect clouds 
using color and texture features in Landsat images. The 
proposed algorithm is efficient and detects most of the clouds 
in Landsat images. The proposed algorithm gives accurate 
detection of clouds in Landsat images. 

III. Neutrosophic Set 

Neutrosophic set (NS), is proposed by Florentin 
Smarandache as a new branch of philosophy dealing with the 
origin, nature and scope of neutralities, as well as their 
interactions with different ideational spectra [11]. In 
neutrosophy theory, every event has not only a certain degree 
of truth, but also a falsity degree and an indeterminacy degree 
that have to be considered independently from each other. 
Thus, a theory, event, concept, or entity, {A} is considered 
with its opposite {Anti-A} and the neutrality {Neut-A}. {Neut- 



A} is neither {A} nor {Anti-A}. The {Neut-A} and {Anti-A} 
are referred to as {Non-A}. According to this theory, every 
idea {A} is neutralized and balanced by {Anti-A} and {Non- 
A} [10], 

Define T, I, and F as neutrosophic components to represent 
{A}, {Neut-A}, and {Anti-A}. Let T, I, and F are real standard 
or non-standard sets of ]'0, l + [ with supT = t_sup, infT = tinf, 
supl = i sup, infl = iinf supF =f_sup, infF =f_inf and n sup 
= tsup + is up +f_sup, n_inf= tinf + iinf +f_inf[ 10]. 7} /, 
and F are not necessarily intervals, but may be any real sub- 
unitary subsets. They are set-valued vector functions or 
operations depending on known or unknown parameters and 
may be continuous or discrete. Moreover, T, I, and F may 
overlap or be converted from one to the other. An element A (t, 
i, f) belongs to the set in the following way: it is t true (t e T), i 
indeterminate (i e I) and /false (f e F), where t, i, and /are real 
numbers in the sets T, /, and F [12]. 

IV. Cloud Detection Algorithm-Proposed Approach 

Remote sensing images are very important remote sensing 
data sources. Due to the influence of weather, cloud cover is 
one of the common obstacles for data analysis in the satellite 
images. So we propose an automatic approach for cloud 
detection. The proposed algorithm consists of following steps: 
Color transformation. Wavelet decomposition. Texture 
extraction. Clustering and Morphological operations. 

Block Diagram of proposed method is shown below. 




Fig 1 Block diagram 
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The algorithm is summarized as below: 

Step 1 : Convert image from RGB color space to Lab color 
space. 

Step 2: Obtain gray scale image from RGB color space. 

Step 3: Use wavelet transformation to decompose the gray 
scale image into sub-bands (LL, LH, HL, and HH). 

Step 4: Calculate the mean of LH, HL sub-bands and the 
gray scale image using Eqs. (1)- (3). 

Step 5: Transform the L color channel and the mean values 
into NS domain independently using Eqs. (4)- (8). 

Step 6: Perform the indeterminacy reduction operation on 
the true subsets using Eqs. (9) and (10). 

Step 7: Apply Fuzzy C-means clustering to the true subset 
using Eq (11). 

Step 8: Find the absolute difference between input and 
brightness corrected reference image. 

Step 9: Perform Morphological operations. 

In this paper, we propose a cloud detection algorithm based 
on wavelet transform and neutrosophic sets [13,14]. The clouds 
are detected based on the color and texture features. Cloudy 
satellite images generally contain the combination of color and 
texture. Therefore, combining color and texture features would 
be of significant benefit in distinguishing regions having the 
same color but different textures, and vice versa. In the first 
step, RGB input image is converted into Lab color space. The 
color features are extracted from Lab transformed image. Lab 
color is designed to approximate human vision. It gives good 
results for segmentation of color pictures. It aspires to 
perceptual uniformity, and its L component closely matches 
human perception of lightness. So for further processes, only 
the L component is used. For extracting the texture features, 
the RGB input image is converted into gray scale image. Then, 
wavelet transform is applied to gray scale image to obtain the 
LL, LH, HL, and HH sub-bands. 2/2 biorthogonal wavelet 
decomposition is used [15], which is separable and 
computationally efficient. One-level wavelet decomposition is 
performed. Only, LH and HL sub-bands are employed for 
further processes because most of the texture information is in 
the LH and HL sub-bands. 

Then mean of LH, HL sub-bands and the gray scale image 
in a local window are calculated. Window size of 5 is used. 
The mean values are calculated for texture characterization. 
Mean can be calculated using the following equations. 

MELH(i,j) = ^r k L 2 - w/ 2 Xr=-w /2 mi + k,j + 0 (1) 

mehluj ) = ^r k il w/2 ir=- 2 w / 2 HL (i + M + 0 ( 2 ) 

MEGray(i,f) = ^Z£l w/2 5£- w /2 Gra Vd + M + 0 (3) 

where w is the size of the sliding window. MELH, MEHL 
and ME Gray are the mean values of LH, HL sub band and the 
grayscale image respectively. 



Then the pixel P(i, j) in the image domain for the L channel 
and the calculated mean values are transformed into the 
neutrosophic domain [16,17], P NS (i, j) = {T(i, j), I(i, j), F(i, j)}. 
T (i, j), I (i, j) and F (i, j) are the membership value belonging 
to true set, indeterminate set and false set, respectively. NS is 
used to describe the indeterminacy in the color texture image. 
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where £f(t,7) is the local mean value of the image. 5(i,y) is 



the absolute value of the difference between intensity g(i, j) and 
its local mean value g(i,j) at (i, j). 

The indeterminacy of the NS image Pns is decreased for 
getting better results. Indeterminacy reduction is performed on 
subset T of L channel and the mean values 
(MELH,MEHL,MEGray) using the following equations. 
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indeterminacy reduction operation and T (i,/) is the mean of 
the true set. 

After the indeterminacy reduction operation, the 
membership set T becomes more distinct and has high contrast. 
So the clouds can be detected more effectively. 

Finally a clustering is performed on the indeterminacy 
minimized data. Clustering is the process of dividing data 
elements into classes so that items in the same class are similar 
as possible, and items in different classes are as dissimilar as 
possible. Here Fuzzy C-means clustering is used. The elements 
in subset T for L color channel and the mean values are 
combined as the input for the Fuzzy C-means clustering [18]. 
The input of the clustering algorithm is a vector X — 
[ r ^L NS ' , E MELHNS ,T MEHLNS ,T MEGra y NS ]At is based on 
minimization of the following objective function: 

lm = Yd=lY. C j=lUij \\x t -Cj \\ 2 ,1 < m < oo (11) 

where m is any real number greater than 1, Uy is the degree 
of membership of x,- in the cluster j, x, is the /th of d- 
dimensional measured data, Cj is the d-dimension center of the 
cluster, and ||*|| is any norm expressing the similarity between 
any measured data and the center. Number of clusters is 
selected as 2. After experimental analysis we have found that 
cluster number 2 gives the best results. Then label every pixel 
in the image using the results from Fuzzy C-Means. For every 
object in the input, Fuzzy C-Means returns an index 
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corresponding to a cluster. Label every pixel in the image with 
its cluster index and the cluster contains cloud pixels are taken 
for further analysis. The cluster index for cloud pixels is found 
out and that cluster index is used for generating a binary cloud 
mask that contains cloud pixels represented as white and non 
cloud pixels represented as black. 

Now we have a binary cloud mask that contains possible 
cloud pixels and we have to eliminate the non cloud pixels. It 
can be done by calculating the absolute difference between 
intensity of target and brightness corrected reference image. If 
it is greater than threshold T then those pixels are treated as 
actual cloud pixels. The threshold T is experimentally set to 25. 
Target image ( I T ) and reference images ( I R ) are two images 
which are observed at different times but cover the same 
region. In this paper we assume that the target and reference 
images are already registered. There is a slightly or heavily 
difference in the brightness of the same-location images 
acquired at different time due to the atmospheric effects, sun 
angles, and sensor looking angles. As a result, brightness 
correction [19] is performed to achieve consistency in the mean 
and standard deviation of the intensity values of target and 
reference images. In this process, the mean and standard 
deviation of reference image is linearly transformed into the 
mean and standard deviation of target image. The brightness 
correction is formulated as follows: 

Ir — Or M/ r ) x ~~ + M/ t (12) 

°>R 

where I R is the reference image, n, R is mean of reference 
image, p Ir is mean of target image, a, T is standard deviation of 
target image, <j Ir is the standard deviation of reference image 
and l' R is the brightness corrected reference image. 

The cloud-contaminated pixels are excluded in the 
calculation of equation 12. The reference image (l R ) is image 
captured over same area as that of target image but on different 
date. We select image with least cloud cover as reference 
image. 

In order to avoid small misclassifications in cloud 
detection, we perform morphological erosion operation with a 
structuring element square of size 3. To select thin clouds on 
the cloud boundaries, a dilation operation with structuring 
element size 9 is applied to the detected cloud mask. Finally a 
binary decision map of same size as input image is created to 
record the detected results of locations of cloud regions. 

V. Experimental Results 

The data considered in this work used a set of images 
acquired with the Landsat-7 ETM+ sensor. All images used in 
this work are cropped to 512x512 pixels. 

Landsat is one of the most widely used satellite data 
sources for local, regional, and global applications because of 
its medium spatial resolution, multispectral bands, and long 
record of historical data. Landsat images provide the longest 
satellite observations of the Earth’s land surface that are 
important to a wide range of remote sensing applications. 
However, due to the nature of optical sensors and the relative 



long revisit cycle. Thick-cloud contamination is a common 
problem in Landsat images, which limits their utilities in 
various land surface studies. Landsat images are highly 
affected by clouds, which present a serious obstacle in their 
applications, especially in monitoring land surface dynamics. 

In figure 2 (a)-(e) the different cloudy Landsat images and 
corresponding cloud detected images are shown. The detected 
cloud regions are enclosed in red boundary. The results are 
shows that the proposed cloud detection algorithm can 
accurately detect almost all thick and thin clouds. 

The evaluation of cloud masks is difficult because ground 
truth is not available to compare the cloud mask. As no ground 
truth is available, the performance of the proposed algorithm is 
evaluated subjectively. Performance evaluation is performed 
quantitatively by the numbers of clouds detected correctly 
(NOC) and the precision of clouds detected (POC) [7], 

The precision of the clouds detected is calculated as 
follows: 

POC =—*100 (13) 

NOM 

where NOM is the number of clouds detected manually. 

In fig 3 (a)-(b) the different cloudy Landsat images (i) and 
corresponding cloud detected images by proposed 
algorithm (ii), direct thresholding (iii),and by Wang et.al (iv) 
are shown. The detected cloud regions are enclosed in red 
boundary in (ii) and (iv) and in magenta boundary in (iii). To 
detect clouds by thresholding, threshold value is 
experimentally set to 1 80 and only red band of Landsat image 
is used. The results are shows that the proposed cloud 
detection algorithm can accurately detect almost all thick and 
thin clouds. 

Table I record the total number of clouds detected by the 
proposed algorithm (NOC), total number of clouds detected 
manually (NOM) and precision (POC) of clouds detected. 

TABLE I. 

Evaluation Results of CLoud detected results of Fig 3(a)-(b) 



Image 


NOC 


NOM 


POC (in %) 


a 


3 


3 


100 


b 


6 


6 


too 



The images used in this result analysis are of size 256 x 256 
pixels. In fig 3(a) all clouds are detected, number of clouds 
detected manually is 3 and number of clouds detected by the 
algorithm is also 3 and result in 100 % precision. 

Table II represents the total number of pixels detected as 
cloud and total number of regions detected as cloud by the 
proposed method, thresholding and Wang et.al mehod. In fig 
3a (ii) total number of pixels detected as cloud by proposed 
approach is 1296, in fig 3a (iii) clouds detected by thresholding 
is 354 and in fig 3a (iv) cloud pixels detected by Wang et.al’s 
method is 554. Only a small number of pixels are detected as 
cloud by thresholding and by Wang et.al method. 
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Fig 2 (a) (b) (c) (d) (e) Cloudy input images and 
corresponding cloud detected images 




Fig 3(a) (b)(i) Cloudy input image corresponding (ii) 
cloud detected image by proposed approach (iii) thresholding (iv) and 
by Wang et.al 



TABLE II. 

Evaluation Results of CLoud detected results of Fig 3(a)-(b) 



Image 


Total Number of pixels 
detected as Cloud 


Total Number of regions 
detected as Cloud 


Proposed 

Approach 


Thres 

holding 


Wang 

et.al 


Proposed 

Approach 


Thres 

holding 


Wang 

et.al 


a 


1296 


354 


554 


5 


8 


3 


b 


30657 


32916 


40073 


27 


62 


14 



In fig 3b (ii) total number of pixels detected as cloud by 
proposed approach is 30657, in fig 3b (iii) clouds detected by 
thresholding is 32916 and in fig 3b (iv) clouds detected by 
Wang et.al is 40073. Here in fig 3b (iii) and (iv) some non 
cloud pixels are also detected as cloud. In fig 3a (ii) total 
number of regions detected as cloud by proposed approach is 
5, in fig 3a (iii) clouds detected by thresholding is 8 and by 
Wang et.al is 3. Thresholding results in cloud segments 
contain only few pixels. So the number of regions detected is 
more than that of the proposed method. 

VI. Conclusion 

This paper presents a new approach for detecting clouds in 
Landsat satellite imagery. The approach uses neutrosophic set 
and wavelet transformation. Then mean of the sub bands and 
grayscale image are calculated for extracting the texture 
features. The experimental results shows that our proposed 
approach yield good detection result, subjectively and 
quantitatively. The proposed algorithm is tested with Landsat 
images. The experimental results have demonstrated that the 
proposed algorithm classifies cloud pixels more accurately in 
time. 
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